forked from Lightning-AI/pytorch-lightning
-
Notifications
You must be signed in to change notification settings - Fork 0
[2/4] Add DeviceStatsMonitor callback #2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…ightning into dstats_callback
* Document exceptions in ipu.py * Document exceptions in tpu.py * Document exceptions in gpu.py
…ightning into dstats_callback
12 tasks
…ightning into dstats_callback
…tly into `trainer.py` (Lightning-AI#9495) Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: thomas chaton <[email protected]>
* deprecate loggerbase.close * deprecate warning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add to changelog * fix import * fix import alphabetize * spacing? * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * copy-paste avoid pre-commit.ci? * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * literally match the other comment * unindent * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * suggest finalize instead of save * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update tests/loggers/test_base.py * format but to be formatted * Update pytorch_lightning/loggers/base.py * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update pytorch_lightning/loggers/base.py * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update pytorch_lightning/loggers/base.py Co-authored-by: Jirka Borovec <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ananthsub <[email protected]> Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: thomas chaton <[email protected]> Co-authored-by: Jirka Borovec <[email protected]>
* Fix gradient accumulation for `ShardedDataParallel` * Update changelog * Update pytorch_lightning/plugins/training_type/sharded.py * add test * Update test_sharded_plugin.py * Update test_sharded_plugin.py * Update test_sharded_plugin.py
… as a whole container (Lightning-AI#9642)
…10 (Lightning-AI#9649) Co-authored-by: Adrian Wälchli <[email protected]>
* reset metrics
Co-authored-by: Carlos Mocholí <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jirka Borovec <[email protected]>
Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: thomas chaton <[email protected]> Co-authored-by: Carlos Mocholí <[email protected]>
Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: Carlos Mocholi <[email protected]>
* wip * reset _notebooks * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * reset _notebooks * testing with mock * update test with mock * update test * update tests * update test * remove track_load_dataloader_calls * update last test * remove unused imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <[email protected]>
…g-AI#9563) Co-authored-by: Carlos Mocholi <[email protected]>
* fix accumuate_grad_batches on init * chlog * update error * move to callback connector * add test with callback * fix tests * Update pytorch_lightning/trainer/connectors/callback_connector.py Co-authored-by: Carlos Mocholí <[email protected]> * update ipu logic * rev * rev * rev * pls work * code review Co-authored-by: Rohit Gupta <[email protected]> Co-authored-by: Carlos Mocholí <[email protected]>
* unique filename * chlog * update tests
…ck (Lightning-AI#9615) * pt1 dir empty check * clean imports * bring back resolve mkdir? * original doc * warningcache * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cp callback after resolve * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * move global_zero check outside warn fn Co-authored-by: ananthsub <[email protected]> * move global_zero check outside warn fn 2 Co-authored-by: ananthsub <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ananthsub <[email protected]>
Lightning-AI#9691) * Deprecate LightningDistributed and keep logic in ddp/ddpSpawn directly * Deprecate LightningDistributed and keep logic in ddp/ddpSpawn directly * Deprecate LightningDistributed and keep logic in ddp/ddpSpawn directly * Deprecate LightningDistributed and keep logic in ddp/ddpSpawn directly * Apply suggestions from code review Co-authored-by: Adrian Wälchli <[email protected]> * Apply suggestions from code review Co-authored-by: ananthsub <[email protected]> * Deprecate LightningDistributed and keep logic in ddp/ddpSpawn directly * Update pytorch_lightning/distributed/dist.py Co-authored-by: ananthsub <[email protected]> * Deprecate LightningDistributed and keep logic in ddp/ddpSpawn directly * Apply suggestions from code review Co-authored-by: ananthsub <[email protected]> * Apply suggestions from code review * Apply suggestions from code review * Deprecate LightningDistributed and keep logic in ddp/ddpSpawn directly Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: ananthsub <[email protected]>
* update warnings * add tests * comments * Apply suggestions from code review * Apply suggestions from code review
…htning-AI#8989) * Deprecate `stochastic_weight_avg` from the `Trainer` constructor * Update CHANGELOG.md * Apply suggestions from code review Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: Kaushik B <[email protected]> Co-authored-by: Jirka Borovec <[email protected]> Co-authored-by: Rohit Gupta <[email protected]>
…I#9704) * dump global_step * add test * chlog
for more information, see https://pre-commit.ci
Closing in favor of Lightning-AI#9712 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do?
Adds DeviceStatsMonitor callback which calls the
get_device_stats
function in Accelerator interface. DeviceStatsMonitor will replace GPUStatsMonitor and XLAStatsMonitorFixes Lightning-AI#9032
Does your PR introduce any breaking changes? If yes, please list them.
Before submitting
PR review
Anyone in the community is welcome to review the PR.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:
Did you have fun?
Make sure you had fun coding 🙃